Multithreading and Thread Migration Using Mpi and Myrinet

نویسنده

  • Stephen Jenks
چکیده

The balance between CPU speed and interconnection network throughput in distributed memory parallel computers varies with each generation of systems, but the trend is that CPUs are gaining performance faster than the interconnection networks. This means that remote data accesses are becoming more expensive relative to local accesses in terms of CPU cycles. Therefore, remote memory access mechanisms that were suited to a previous generation of parallel machines may be less appropriate for current clusters. This research evaluates a multithreaded programming paradigm with cached remote memory accesses and thread migration to exploit array locality on a cluster with Myrinet. The approach, called Nomadic Threads, was originally developed for the CM5, but has been adapted to use MPI on Linux clusters. The results show that the current surfeit of CPU power vs. network throughput dramatically changes scaling characteristics of some programs while others behave much as they did on the decade-old CM5.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance evaluation of DECK combining multithreading and communication on Myrinet and SCI clusters

This paper presents a performance evaluation of DECK (Distributed Execution and Communication Kernel), a multithreaded parallel programming environment for clusters of SMPs, with the parallel implementation of the classical Mandelbrot fractal generation and Laplace’s Equation algorithms. The applications have been run on Myrinet and SCI clusters and the results are compared to corresponding MPI...

متن کامل

CCJ: object-based message passing and collective communication in Java

CCJ is a communication library that adds MPI-like message passing and collective operations to Java. Rather than trying to adhere to the precise MPI syntax, CCJ aims at a clean integration of communication into Java’s object-oriented framework. For example, CCJ uses thread groups to support Java’s multithreading model and it allows any data structure (not just arrays) to be communicated. CCJ is...

متن کامل

Test suite for evaluating performance of multithreaded MPI communication

As parallel systems are commonly being built out of increasingly large multicore chips, application programmers are exploring the use of hybrid programming models combining MPI across nodes and multithreading within a node. Many MPI implementations, however, are just starting to support multithreaded MPI communication, often focussing on correctness first and performance later. As a result, bot...

متن کامل

Implementing MPI in a High - Performance , Multithreaded Language

We have ported the MPICH implementation of MPI to the high-performance, multithreaded programming language MPC++. We discuss our modications to the design of MPICH to support multiple threads. MPICH now runs experimentally on top of MPC++ on a Sun workstation cluster connected by Myrinet and achieves higher performance than standard MPICH on Myrinet TCP/IP on the same hardware.

متن کامل

Parallel Quicksort Implementation Using Mpi and Pthreads

This report describes the approach, implementation and experiments done for parallelizing sorting application using multiprocessors on cluster by message passing tool (MPI) and by using POSIX multithreading (Pthreads). Benchmarking using 1MB sort and Minute sort were done .Pivot optimization techniques are also discussed for worst case scenarios in quicksort.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004